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Abstract. We introduce qustochastic matrices as the bistocha- 
stic matrices arising from quaternionic unitary matrices by repla- 
cing each entry with the square of its norm. This is the quaterni- 
onic analogue of the unistochastic matrices studied by physicists. 
We also introduce quaternionic Hadamard matrices and quater- 
nionic mutually unbiased bases (MUB). In particular we show that 
the number of MUB in an n-dimensional quaternionic Hilbert space 
is at most 2n + 1. The bound is attained for n — 2. We also de- 
termine all quaternionic Hadamard matrices of size n < 4. 
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1. Introduction 

Let B n be the set of n by n bistochastic (also known as doubly sto- 
chastic) matrices, i.e., real matrices all of whose entries are nonnegative 
and row and column sums are identically 1. The set B n is also known 
as the Birkhoff polytope; it is exactly the convex hull of the n by n 
permutation matrices. We shall use the standard notation O(n), U(n) 
and Sp(n) for the classical compact Lie groups: the real orthogonal, 
complex unitary and the quaternionic unitary (symplectic) group re- 
spectively. Let (f) r : O(n) — > A n , 4> c '■ U(n) — > A n and <fih '■ Sp(ra) — > A n 
be the maps given by (fi(Z) it j = \Zij\ 2 where A n is the affine space 
spanned by B n . The images of the first two maps are called the or- 
thostochastic and unistochastic matrices respectively. We introduce the 
term qustochastic for matrices belonging to the image of <fih- We warn 
the reader that the term "orthostochastic" has been used in the past 
by mathematicians, e.g. in [Tj|, to designate the class of bistochastic 
matrices to which we refer now as "unistochastic" . 

We will use the notation O n , lA n and Q n for the n by n orthostochas- 
tic, unistochastic and qustochastic matrices, respectively. Clearly we 
have O n C U n C Q n C B n . If n = 2 all of these are equal. It is known 
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that O n C U n C B n for n > 3. It is easy to see that U 3 = Q 3 and we 
will show that U n C Q n C £> n for n > 3. 

To dephase a complex matrix means to multiply it on the left and 
right by unitary diagonal matrices so that the uppermost row and the 
leftmost column become both real and nonnegative. We say that two 
matrices are equivalent if one can be obtained from the other by multi- 
plication on the left and right by unitary monomial matrices. Similar 
definitions apply to the real and quaternion cases using real orthogo- 
nal and symplectic monomial matrices respectively Note that in the 
quaternion case these operations allow the conjugation of all entries 
by a fixed unit norm quaternion. A class of equivalent matrices under 
the above relation may contain more than one dephased form. For 
example the dephased form may change under a permutation of rows 
and columns. If a is a quaternion then we write it in coordinates with 
respect to the standard basis as a = a\ + e^i + a 3 ] + aji. We use the 
notation a pu for the pure part of a. 

In section |5] we determine the singular points of the map 4> r for n < 4 
and the critical points of 4> c and (f)h for n < 3. These critical points are 
important if one wants to determine the boundaries of U n and O n in 
A n . Indeed, if Z G U(n) and 4> C {.Z) G dU n , the boundary of U n , then 
Z is necessarily a critical point of C . 

The van der Waerden matrix J n G B n , all of whose entries are equal 
to 1/n, is the barycenter of B n . We have that J n G U n for all n but 
J n G O n if and only if there exists a real Hadamard matrix of order 
n. As J 3 G^ O3 it is an interesting question to find the distance from 
J3 to O3 (see Proposition 13.21) . We also consider criteria for certain 
bistochastic matrices to be orthostochastic. It is easy to construct a 
system £„ of n(n — 1) polynomial equations that must be satisfied 
by an n by n orthostochastic matrix. We determine that for n > 16 
this system is not sufficient to characterize the set of orthostochastic 
matrices. This is done in section [31 

The matrices A = \fnZ where Z G U(n) and <fi c {Z) = J n are the 
complex Hadamard matrices. They satisfy A* A = nl n and \Aij\ = 1 
for all i, j. In section H] we define similarly the quaternionic Hadamard 
matrices. Thus, up to the scale factor y/n, the set of complex Hadamard 
matrices of order n can be identified with the fibre of (f) c above the point 

Jn- 

While the real Hadamard matrices have been studied for more than 
100 years, and the complex ones for several decades, the quaternionic 
ones have been totally neglected. Every quaternionic Hadamard ma- 
trix is equivalent to a complex one if n < 3, but not so for n > 4. 
In the case n = 4 we have constructed two families of quaternionic 
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Hadamard matrices: a generic one depending on three real parame- 
ters and a special one having only two parameters. Any quaternionic 
Hadamard matrix of order 4 is equivalent to a matrix belonging to one 
of these two families. 

In section|5]we extend the concept of mutually unbiased bases (MUB) 
from the case of an n-dimensional complex Hilbert space to the quater- 
nionic case. It is well known that in the complex case the number of 
MUB of C n does not exceed n+1 and that this bound is attained when 
n is a power of a prime number. In the quaternionic case this bound 
is no longer valid and we obtain the new bound 2n + 1 i.e., for the 
quaternionic Hilbert space H n . This new bound is attained for n = 2, 
i.e., for 2 by 2 quaternionic matrices. The case n = 3 is much harder. 

In order to construct MUB (A , Ai, . . . , Ak) consisting of 3 by 3 
symplectic matrices, we may assume without any loss of generality that 
Aq = 1%, and A\ = F% is the complex Fourier matrix. We have classified 
all possible members A 2 in such MUB by constructing a generic 3- 
parameter family and five 2-parameter families. Up to conjugation by 
a fixed complex number, \/3A 2 is equivalent to a matrix in one of these 
families. Using this we were able to verify that a set of complex MUB 
consisting of four bases (there is only one such set up to equivalence) is 
maximal i.e. cannot be extended to quaternionic MUB of larger size. 
We also find a 3-parameter set of four quaternionic MUB and show 
that they are all maximal. 

The more complicated calculations and proofs were verified with 
Maple [9J. Throughout the paper we occasionally remind the reader 
that a calculation was carried out on the computer. 



We will examine the critical points of the maps <ft c and <ph (and 
singular points of <p r ) in small dimensions n. The definition of singular 
points will be given later. For the tangent space T B A n of A n at B e A n 
we use the basis of matrices where 



2. Critical Points 



B 




1 if (k,l) = (i,j) or (n,n) 
-1 if (k,l) = (i,n) or (n,j) 



else 



and 1 < i, j < n — 1. 
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A k,l 



1 if (k,l) = (i,j) or (j,i) 
else 



To describe the various tangent spaces, let us introduce the following 
matrices A l ' j and C M by specifying their entries: 

1 if (k,l) = (i,j) 
-1 if (k,l) = (j,i) , Cg = 
else 

1 < % < j < n 1 < % < j < n. 

A basis of the tangent space T x O(ra) at X e O(n) is given by 

A^X, i < j. 

A basis of the tangent space T z U(n) at Z G U(n) is given by 

A iJ Z, i < j; iC^Z, i < j. 

Similarly a basis for the tangent space TVSp(n) at W G Sp(n) is given 
by 

A l ' J W, i<j; iC^W, jC^W, kC lJ W, i < j. 

Now 4> r is obtained by restricting the codomain of the composition f r oi 
where f r : M n (M.) — > M n (R) squares each entry and i : 0(n) — > M n (R) 
is the inclusion map. So we can find the differential (df r )x of / at X 
and then restrict to the tangent spaces to get the differential (d(f) r )x ■ 



TxO(n) 



An- 



With a suitable ordering of the basis vectors, the differential (df r )x 
is represented by the n 2 by n 2 diagonal matrix 2-diag(xi 5 i, x\ t 2, ■ 
In fact we have 



(2.1) 



(df r ) x (Y)=XoY, 



where X o Y denotes the Hadamard product of matrices X and Y. 
Explicitly, we have that 



X o (A l ' j X) = 















^i,2^j,2 ' ' ' •Ei,n%j,n 



%j,n%i,n 















where entries are zero except in the i th and j th rows. Then it is not 
difficult to see that 

n-l 

X o (A^X) = ^. /•,.,,-,,( - B*> k ) 



k=l 
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when j < n and 

n-l 

Io(#»I) = ^ffV»> 

fc=l 

These expressions will be used to give the matrix representation of 
(d<p r )x with respect to the described bases. 

In the complex case, the Jacobian of the map C — > K. given by 
Zij = Xij + yij'i h-> \zij\ 2 = x\j + yfj is the matrix 

2,Xij 

It sends the tangent vector Wij = Uij + Viji to 

~2x 



It follows that 



2Re(zijWi 
(df c ) z (X) = Re(ZoX) 



and consequently 

(2.2) (A<p c ) z (XZ) = Re(Z o (XZ)) 

for any skew-hermit ian matrix X and Z G U(n). If i — j we see that 

(d0 c ) z (iL7^Z) = 0. 

If W is symplectic we mention that the analog of (12.21) holds. 

Note that if one is to consider the Jacobian of 6 r or 0/j at a real 
orthogonal matrix, it is the Jacobian of <p r with extra columns of zeros 
appended and similarly for unitary matrices under 0^. Thus if X G 
0(n) then 

rank(d0/ l )x = rank(d0 c )x = rank(d0 r )x, 
and if Z G U(n) then 

(2.3) rank(d0 h ,) z = rank(d0 c ) z . 

We say that Z G U(ra) is a critical point of </> c if rank(d0 c )z < 
dim^4 n — (n — l) 2 . We define similarly the critical points of <ph- We 
say that X G 0(n) is a singular point of <p r if rank(d0 r )x < dimO(ra) = 
n(n — l)/2. Note that strictly speaking, a critical point of <p r is a point 
X G O(n) such that rank(d0 r )x < dim^4 n — (n — l) 2 . However 
since dimO(n) < dim^4 n for n > 3, we would have that all points are 
critical for n > 3. Thus for C and <ph a critical point is one where the 
differential is not surjective, while for <f> r we use the term singular point 
when the differential is not injective. 
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Remark 2.1. From formula (12. ip we see that (df r )x is an isomorphism 
if and only if X has no zero entries. In particular if X G O(n) is a 
singular point of <f> r , then X must have a zero entry. Hence the set of 
nonsingular points of p r is an open dense subset of O(n). 

The analogous question for p c (and ph) remains open, see Problem 



We say that a square matrix A splits if there exist permutation ma- 
trices P and Q such that PAQ is a direct sum of two square matrices 
of smaller size. The set of critical points is invariant under equivalence 
[T2"l 3.1 and 3.2]. We have the following fact: 

Theorem 2.2. If an orthogonal matrix splits, then it is a singular 
point of the map p> r . Similar statements hold for the critical points of 
the maps p c and (ph. 

Proof. Let X G O(n) split and since X is a singular point iff PXQ is 
a singular point for permutation matrices P and Q, we assume that X 
is a direct sum of square matrices of smaller size. Assume 1 < j < n is 
the smallest index such that X is the direct sum of a j by j block and 
an n — j by n — j block. Then X^j = for k > j, I < j and for k < j, 
I > j. Consider the basis vector A l 'i +1 X which has first row equal to 
the (j + l) s * row of X and has (j + l) st row equal to the negative of 
the first row of X and all other rows are zero. Now it is not difficult 
to see that the Hadamard product X o (y4 lj+1 X) is the zero matrix. 
Thus y4 lj+1 X G ker(d(p r )x and X is a singular point of the map <p r . 

If Z G U(n) splits we must show that rank(d0 c )z < (n— l) 2 . Again 
assume 1 < j < n is the smallest index such that Z is the direct 
sum of a j by j block and an n — j by n — j block. Then Zk,i = 
and using (12.21) we see that A h ' 1 G ker(d(p c )z for k > j, I < j and 
for k < j, I > j. Also {d(p c ) z (iC^Z) = for 1 < i < n. Thus 
dim ker(d0 c )z > 2(n — j)j + n > 2n and Z is a critical point. 

If W G Sp(n) then the proof is similar to the complex case using the 



Now we will find the singular (critical) points of <p r (<p c and ph) for 
n < 3. For a real or complex matrix X, we denote by 



its k by k minor in rows ii, . . . , i^ and columns ji, ■ ■ ■ ,jk- 

Theorem 2.3. Let n = 2. The singular (critical) points of p r (<p c and 
<ph) are the diagonal and anti-diagonal matrices. 



1631 



analog of Q . 



□ 
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X 



Proof. Let 

a?i,i a?i,2 

^2,1 ^2,2 

be a real orthogonal matrix. The Jacobian of r at X is the 1 by 1 
matrix [£1,1X2,1] ■ It is clear then that there are 8 critical points given 
by the matrices 



±1 


" 




" 


±1" 





±1 




±1 






It is not difficult to see that this reasoning carries over to the complex 
and quaternion cases since any 2 by 2 unitary or symplectic matrix is 
equivalent to a real orthogonal matrix. □ 



Theorem 2.4. Let n 

matrices that split. 



3. The singular points of the map <p r are the 



Proof. It is clear that any matrix that splits is a singular point. Let 
X = [xij] £ 0(3) be a singular point. The Jacobian of <p r at X is the 
4 by 6 matrix (three columns of zeros ignored): 



J 



£1,1X2,1 

^1,2^2,2 
-Zl,lX 2 ,l 

—£1,2X2,2 



£l, 1^3,1 
^1, 2^3,2 









£2, 1^3,1 

£2,2^3,2 



If X has two or more zeros it must split. Assume X has at most one 
zero which we may move to the third column. Permute the rows of X 
so that 

1,2. 



X 



^0. 



Then 



J 



so 



a contradiction. 



□ 



Theorem 2.5. Let n = 3. The set of critical points for the maps 4> c 
and (ph is, up to equivalence, the set 0(3) of 3 by 3 real orthogonal 
matrices. 

Proof. We consider the complex case first. Note that each Z £ 0(3) is 
a critical point of <ft c (see [T2| Lemma 3.7]) and as mentioned earlier, 
each X G U(3) which is equivalent to Z is also a critical point of <p c . 



cS 
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To prove the converse, consider an arbitrary critical point in de- 
phased form: 

£1,1 £1,2 £1,3 

£2,1 £2,2 + 2/2,21 x 2 , 3 + 2/2,31 

53,1 £3,2 + 2/ 3j2 i x 3i3 + 7/ 3i3 i 

Construct the 4 by 9 Jacobian (note that three columns are zero and 
can be ignored): 



J 



£i,i£2,i £1, 1^3,1 

^1,2^2,2 £1,2^3,2 

-21,1X2,1 x 2 ,ix 3A 

-^1,2^2,2 a 





-£1,22/2,2 


£1,22/2,2 





-£1,22/3,2 



p 



where a = x 2 ,2£3,2 + 2/2,22/3,2 and /3 = 2/2,2^3,2 - £2,22/3,2- Then all 4 by 
4 minors of J are 0. In particular 



J 



1,2,3,4 
1,2,4,5 



£l,l£2,l£3,l£l, 2 2/2,22/3,2 



0. 



We have two cases. 

Case 1: Z has a row with all entries not equal to 0. Then since Z 
is unitary it can be seen that it must have a column with all entries 
not equal to 0. After permuting rows and columns (independently) 
and dephasing, we may assume that all terms 2^1, > 0. According 
to the above minor, we see that at least one of 1/2,2 or y 3j2 must be 0. 
Then using the fact that the rows and columns of Z are orthogonal we 
conclude that the remaining y^j must also be and Z is in fact real 
and orthogonal. 

Case 2: Z has a zero in each row. Then since Z is unitary we see it 
must split. 

Now we consider the quaternionic case. By dephasing a 3 by 3 sym- 
plectic matrix W and then conjugating each entry by a fixed quaternion 
of unit norm so as to make the (2, 2) entry complex, we see that W is 
equivalent to a complex matrix Z. Thus if W is a critical point of (ph 
so is Z. By (12.31) we have that Z is also a critical point of <p c and thus 
is equivalent to a real matrix. □ 

We now take a look at the singular points of r : 0(4) — > A^. 

Theorem 2.6. Let n = 4. There are two types of singular points of the 
map (p r . The first are the matrices that split. The second type are the 
matrices which are equivalent to a matrix having all diagonal entries 
zero and all other entries nonzero. 
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Proof. By theorem 12.21 all orthogonal matrices that split are singular 
points. Also it is clear after a computation that matrices of the second 
type are also singular points. Assume X = [xij] G 0(4) is a singular 



point. The Jacobian of r at X is (ignoring zero columns): 



J 







£2,1^3,1 
£2,2^3,2 
^2,3^3,3 
-x 3 ,ix 2 ,i 

—^3,2^2,2 
—^3,3^2,3 

All 6 by 6 minors of J vanish. By evaluating 



Xl,lX 2 ,l 
£1,2^2,2 
£l,3^2,3 
-^2,1^1,1 
— ^2,2^1,2 
— ^2,3^1,3 







£1,2^3,2 
^1,3^3,3 





— £3,1X1,1 
—£3,2X1,2 
- X33X1 3 



Xl, 1X4,1 
£1,2X4,2 

^1,3^4,3 














£2,1X4,1 
£2, 2X4,2 

^2,3^4,3 














X3, 1X4,1 
£3,2X4,2 
£3,3^4,3. 



J 



1,2,3,4,5, A; 
1,2,3,4,5,6 



7,8,9, 



we obtain 

(2.4) Xi,iXi,2Xi, 3 X2,lX2,2X 3 ,fcX4,fcA 



3,4 
1,2 



X 



2,3,4 
1,2,3 



0. 



Notice that the last two factors are nested minors of X. Whenever it is 
possible to find two nested nonzero minors of X we may permute rows 
and columns to assume that they are 

'3,4\ , _/2,3,4 N 



X 



1,2 



and X 



1,2,3 



and then use the equations f)2.4p to conclude the existence of zero entries 
in X. Now we consider cases depending on how many zero entries are 
in X. If a 4 by 4 real orthogonal matrix has more than 4 zero entries, 
it splits. 

Case 1 : X has at most one zero entry. We may assume the zero entry 
is in the fourth column if it exists. Since X is orthogonal, we may find 
the required nested nonzero 3 by 3 and 2 by 2 minors amongst the first 
three columns of X. Then permuting the rows of X and the first three 
columns as necessary we may assume that the nested nonzero minors 
are in the top left corner of X. Then (12.41) gives a contradiction. 

Case 2: X has exactly 2 zero entries. If they are in the same 
row/column we may permute and take transpose as necessary to have 
them in the (3, 4) and (4, 4) positions and repeat the above argument 
to get a contradiction. Otherwise we place them in the (3, 3) and (4, 4) 
positions. Let the four rows of the first three columns be a, b, c and 
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d. We now must find a nonzero 3 by 3 minor of the first three columns 
containing row c. Assume all such minors are zero. Consider the minors 
using {a, b, c} and {b, c, d}. We have that 

c = aa + [3b = 76 + Sd 

for some a, /3,j,8 G M. We get 

aa + (j3 — 7)6 — 5d = 0, 

but since X is nonsingular, the {a, b, d} minor is nonzero and so a = 0. 
Thus c = (3b but clearly this is a contradiction since c has a zero entry. 
Now we may also find a nonzero 2 by 2 nested minor in the first two 
columns. Then equations (12.41) give a contradiction. 

Case 3: X has exactly 3 zero entries. Clearly they are not all in one 
row or column. If two are in the same row/column we may assume they 
are in the fourth column and we use the arguments from the previous 
case to get a contradiction. Thus assume the zeros appear in (2,2), 
(3,3) and (4,4). Let the four rows of the first three columns be a, b, 
c and d. We now must find a nonzero 3 by 3 minor of the first three 
columns containing rows b and c. Assume all such minors are zero. 
Then both a and d would be in span{6, c}, which is clearly impossible. 
As the nested 2 by 2 minor contained in rows b and c is nonzero, the 
equations (12. 4p give a contradiction. 

Case 4: By now we see that we may assume that the 4 zeros are on 
the main diagonal and we are done. □ 

Remark 2.7. If X 6 O(n) has zero diagonal, then all principal n — 1 by 
n — 1 minors of X vanish. This follows from Cramer's formula for the 
entries of X" 1 and the fact that X" 1 = X T also has zero diagonal. 



3. Orthostochastic Matrices 



Consider an aribtrary 3 by 3 bistochastic matrix: 



(3.1) 



x y * 
z w * 
a b * 



It is orthostochastic if and only if the vectors 

(V^, yfz, y/a) 



and 



are orthogonal for some appropriate choice of signs. That is 
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for some e$ = ±1. Squaring gives 



xy + zw + lex^/xyzw = ab. 

Substituting a = 1 — x — z and b = 1 — y — w and squaring again gives 

(3.2) (1 — x — y — z — w + xw + yz) 2 = Axyzw. 

We are thus lead to the following result of H. Nakazato [10]. Ap- 
parently he failed to notice that the defining equation of 3 can be 
written in this simple form. A similar expression appears in [6] as an 
inequality describing U 3 . 

Proposition 3.1. A 3 by 3 bistochastic matrix A3. 1\) is ortho stochastic 
if and only if it satisfies the equation A3, ty) . 

Since J 3 ^ 3 we find the distance between J 3 and the set 3 . 

Proposition 3.2. The distance from J 3 to 3 is v2/3 and the closest 
points on 3 are 



(3.3) 



1 

9 



1 4 4 
4 1 4 
4 4 1 



and all points obtained by permuting the rows and columns of this ma- 
trix. 



Proof. Let B be a bistochastic matrix as in (13.11) . 
minimum of the squared distance 

d(B, J 3 ) 2 = f(x,y,z,w) 



We look for the 



under the constraint (13. 2h . Using the method of Lagrange multipliers 
we get a system of 5 equations which Maple is able to solve. Up to 
permutations of the rows and columns, there are two stationary points. 
One is 

^1 2 l" 

2 2 
1 2 1 



1 

4 



at a distance of | from J 3 and the other is the matrix (I3.3P at a distance 
of V2/3 from J 3 . By using the Hessian test for optimization problems 
with a constraint, one can check that the first of these points is a saddle 
point and the second a local minimum. Note that the first point is the 
barycenter of one of the faces of £> 3 and so it is the closest point on 
dB 3 to J 3 . We can now conclude that / has absolute minimum at the 
second point. □ 
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This result is stated in [3]. It is mentioned to have been deduced 
from a similar problem involving minimizing entropy, which was solved 
numerically in [7]. 

Now consider an arbitrary 4 by 4 bistochastic matrix: 

x u p -k 

(3.4) y v q * . 

v 7 z w r -k 

a b c -k 

It is orthostochastic if and only if the vectors 

(Vx, Vy, y/z, Va), 
(y/u, eyy/v, e 2 Vw, e 3 Vb), 

and 

are orthogonal for some £j, z/j = ±1. The existence of e$ to make the 
first two vectors orthogonal is given by the equation 

sfxu + Eiy/yv + E 2 \fzw + e 3 Vab = 

which is equivalent to 

xu + yv + 1s\^/xuyv = zw + ab + 2e2£sV zwab. 

Then rearranging and squaring again gives 

(xu + yv — zw — ab) 2 = Axuyv + Azwab — IexEiE^ xuyvzwab. 

Finally squaring one last time and substituting a = 1 — x — y — z and 
6 = 1 — u — v — w gives 

(3.5) [(xu + yv — zw — (1 — x — y — z)(l — u — v — w)) 2 

—Axuyv — Azw(l — x — y — z)(l — u — v — w)] 2 

= Axuyvzw(l — x — y — z)(\ — u — v — w). 

The expressions for the other pairs of vectors are similarly obtained. 
For example the first and third columns can be made orthogonal if and 
only if the equation 

[(xp + yq — zr — (1 — x — y — z)(l — p — g — r)) 2 

—Axpyq — Azr(l — x — y — z)(l — p — g — r)] 2 

= Axpyqzr(l — x — y — z)(l — p — q — r), 

is satisfied. We obtain all together six equations by considering all 
pairs of columns. As well we obtain six more equations by considering 
pairs of rows. 



ORTHO-, UNI- AND QU-STOCHASTIC MATRICES 



13 



Proposition 3.3. // the matrix jS.Jjty is ortho stochastic matrix then 
h'J. 5\) and the eleven remaining equations hold. 

Remark 3.4. The problem of determining if these twelve equations are 
sufficient to ensure that a 4 by 4 bistochastic matrix is orthostochastic, 
is open. 

For an n by n bistochastic matrix X one can easily write a similar 
system S n of n(n— 1) equations. Let X' be the n by n matrix with X-j = 
y/Xij. An equation in S n vanishes if and only if the corresponding pair 
of columns or rows in X' can be made orthogonal by appropriately 
choosing signs for the entries. Thus an n by n orthostochastic matrix 
will satisfy S n . Then a natural question arises: Does E n characterize 

It is easy to see that the answer is negative in general. Indeed if 
n = 2 (mod 4) then J n satisfies S„. On the other hand, if also n > 2, 
Jn ^ On because there is no n by n Hadamard matrix. 

Moreover there are also counterexamples for all n > 16. First we 
prove a lemma. 

Lemma 3.5. For n > 1 there exist positive real numbers CLi , . . . , CL n 
such that ^2ai = 1, and the numbers 1 and ^fa^aj for 1 < i < j < n 
are linearly independent over the rational numbers. 

Proof. Choose positive real numbers £j, 1 < i < n — 1 which, are 
algebraically independent over Q, the rational numbers, and are such 
that EC 2 < !• Let 

n-l 

<H = £?, 1 < i < n - 1; a n = 1-^2 

i=i 

To prove the assertion, assume that 

r ° + r hiV a i a 3 = 

l<i<j'<n 

for some rational numbers r and r^j. This can be rewritten as 

n-l 

l<i<j<n—l i=l 

Squaring both sides and rearranging gives 

(ro + E - ( E r U*l f 1 - E = °- 

V l<i<i<n-l / \i=l /V i=l / 
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The coefficient of for 1 < i < j < n — 1 is r\ - + rf n . Now since 
the £j are algebraically independent over Q we have that r and r^- all 
vanish. □ 

Now we will construct a bistochastic matrix which satisfies S n but 
is not orthostochastic, using arguments from [TJ. For two matrices A, 
B we denote by AB the convex hull of {A, B}. 

Proposition 3.6. Let n > 16. 

(i) There exist X in B n satisfying E n which are not in O n . 

(ii) There exist a set S of permutations on n letters such that P a P T 
is contained in O n for any a, t in S , while the convex hull of S 
is not contained in O n . 

Proof. To do this we will need some definitions and the Hurwitz- Radon 
theorem [8]. Let A be an elementary abelian subgroup of order n = 16 
of S n . Choose some linear order < on A with the identity id being 
the smallest element and let a — > R a be the regular (permutation) 
representation of A. Let {a a } ae A be as in Lemma [3.51 and let 

X = a^Ro- G B n . 

aeA 

The rows and columns of X are labelled based on the linear order from 
A. One can see that X a ^ = a a p. 

Consider distinct columns a and j3 in X. For any a £ A there is a 
unique corresponding r G A such that era = t/3. Then X a ^ a = X Tj p and 
X Tja = X^/3 and it is easy to see that we may choose signs appropriately 
so that the corresponding columns of X' are orthogonal. A similar 
argument applies to the rows. Thus X satisfies S n . Now choosing signs 
for all entries in X' to produce an orthogonal matrix is equivalent to 
the existence of diagonal orthogonal matrices D a such that 

J2Va a R a G O(n), 

where R a = D a R a . Assume such diagonal orthogonal matrices exist. 
We can clearly assume that Ad = In an d -Rid = -Rid = In- Thus by 
orthogonality we have 

a-aR^Ro + \/a a a T (R„R T + R^R a ) = I n . 

<t£A (7<T 

The first summation simply equals /„. Thus since y/a^aj are linearly 
independent over Q, we get 

RlR T + RTRr = 0, a < t, 
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and in particular 

Rl + R a = 0, a > id, 

and 

RlR a = -Rl =I n , a> id. 

Thus the Hurwitz- Radon equations are satisfied. However the Hurwitz- 
Radon number p(16) = 9 is less than 15. Since the number of matrices 
that can satisfy the Hurwitz-Radon equations is always less than or 
equal to p(n), we have a contradiction. 

The group A immediately provides us with the set S for the second 
part of the proposition. All elements of A are commuting involutions 
and so by Theorem 14.41 each edge P a P T for a, r G A is orthostochastic, 
yet the matrix X constructed above is not. 

We then extend the result to all larger dimensions n > 16 by exten- 
ding each permutation a G A so that a(i) = i for i > 16. □ 

In pQ it is asked if a set of permutation matrices containing only 
commuting involutions has a unistochastic convex hull. We have thus 
shown in the above proposition that the answer is false in the ortho- 
stochastic case. Note that in that paper the term "orthostochastic" 
corresponds to our term unistochastic. 



4. Quaternion Hadamard Matrices 



An n by n quaternion Hadamard matrix is a matrix H G M n 
such that each entry has unit norm \H it j\ = 1 and H*H = nl n where 
denotes conjugate transpose, i.e., 77= is symplectic. The n by n 



Fourier matrix, F n , is the unitary matrix given by 

[-^n]zj ~ 



(i-l)Cj-l) 



, n 

where lo = e 2wl ^ n . Dephasing a 2 by 2 symplectic matrix gives a real 
matrix while dephasing and conjugating entrywise a 3 by 3 symplectic 
matrix yields a complex matrix. It follows that every 3 by 3 quaternion 
Hadamard matrix is equivalent to the Fourier matrix. Thus we will 
investigate the structure of 4 by 4 quaternion Hadamard matrices. 

Let us first introduce two families of such matrices. The 2-parameter 
special family of 4 by 4 quaternion Hadamard matrices consists of ma- 
trices of the form 



1 


1 


1 


1 


1 


-1 


b 


-b 


1 


a 


X 


z 


1 


—a 


y 


w 
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where a and b are unit norm quaternions of the form a 
b = bi + 6 2 j and 



(4.1) 



x = — -(1 + a + b — ab), 



V = -t^ 1 - a + b + ab), 



--(1 + a-b 



ai + a 2 i, 



ab) 



w 



— -(1 — a — b — ab). 



The 3-parameter generic family of 4 by 4 quaternion Hadamard ma- 
trices consists of the matrices of the form 



1 

a 
c 



1 

b 
d 

■1-b-d 



-1 - a 
-1-c 
l + a + b-\ 



-b 
-d 
c + d 



where 



l + a 
1 + al 



x- 



I + a 

II + a\ 



d 



l + a . 

X-. rl 

1 + a 



a / -1 and x are unit norm quaternions of the form a 
and x = x 2 i + X33, and a = a\ + a 2 i. 



ai + a 2 i + a 3 j 



Theorem 4.1. T/ie matrices in the special and generic families are 
Hadamard. 

Proof. We verified using Maple that these families indeed consist of 
Hadamard matrices. It is also possible to do the verification by hand. 

□ 

In the case n = 4 there are genuine 4 by 4 quaternion Hadamard 
matrices, and we see that this allows the set of qustochastic matrices 
to be larger than the unistochastic matrices. 

Theorem 4.2. U n is a proper subset of Q n if n > 4. 

Proof. Let n = 4 and consider the matrix H from the special family 
with a = i and y2b = 1 + j. Then the differential (d^)^ is a 9 by 36 
matrix which a computation shows to have rank 9. To be specific, the 9 
by 9 minor of (d<f)h)H made up from columns corresponding to basis vec- 
tors EH for E e {A 1A , A 2 ' 3 , A 2 ' 4 , A 3 - 4 , iC 1 - 4 , iC 2 < 4 , jC 1 ' 4 , jC 2 ' 4 , kC 3 ' 4 } is 
nonzero. Thus there is an open ball of qustochastic matrices in the 
Birkhoff polytope centered at the van der Waerden matrix J 4 . This is 
not the case for unistochastic matrices for n = 4 as seen in [5]. Thus U4 
is a proper subset of <2 4 . The result easily extends to the cases n > 4 
by observing that the matrix 4>h{X) © J n _ 4 is in Q n \ U n if X G Sp(4) 
is chosen so that 4>h(X) ^ W 4 . □ 
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Example 4.3. To see that Q n is a proper subset of B n for n > 3 simply 
note that 

^1 



1 
2 




1 
1 




1 
1 




are bistochastic, but obviously not qustochastic. This argument is the 
same as the one used in [21 Theorem 5]. 

For a permutation a in the symmetric group on n symbols, we let 
P a be the n by n permutation matrix with entries (P a )i,j — & 
map a — > P a is a group homomorphism. 



The 



Theorem 4.4. // P CT P r C Q n then a V is an involution. Conversely, 
if o~~ x t is an involution, then P a P T C O n . 



Proof. Assume that P a P T Q Qn- It is easy to see that the matrix 
A = | (-Po- + P T ) is, up to permutation of the rows and columns, a 
direct sum of blocks 



of various sizes. Since A e Q„, only blocks of size 1 or 2 occur. This 
implies that cr _1 r is an involution. Conversely if a~ x r is an involution 
then any matrix pP a + (1 — p)P T for < p < 1 is (up to permutation 
of rows and columns) a direct sum of blocks 



or 



p 

1 — p 



1 — p 
p 



and is easily seen to be in O n . 



□ 



These arguments are similar to those found in [T]. 

Now we will show that the special and generic families comprise all 4 
by 4 quaternion Hadamard matrices up to equivalence. We begin with 
a lemma: 

Lemma 4.5. Let u = [l,a,x,y] T be a 4 by 1 quaternionic column 
vector with entries of unit norm. Assume u is orthogonal to the column 
vector with all entries equal to 1 and a ^ 1. Then 

1 + a , 1 + a 



(4.2) 



x 



h 



y 



-h 



where h is orthogonal to 1 + a and 2\h\ 
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Proof. Let h = x + Note that 1 + a = — x — y, 2h — x — y 

and |x| = |y| = 1 so /t J_ 1 + a. Then |/i — i^| 2 = \x\ 2 = 1 so 
\h\ 2 = 1- |±±^| 2 = |±^| 2 since l + oll-o. Thus 2\h\ = \l-a\. The 
expression for y is straightforward. □ 

Theorem 4.6. E'ac/i 4 by 4 quaternion Hadamard matrix is equivalent 
to one in the special or generic family. 

Proof. After dephasing a 4 by 4 quaternion Hadamard matrix we have 
one of the form 

"l 1 1 l" 

1 a d g 
1 b e h ' 
1 c f i_ 

If one of the nine entries a, b, is real, it must be ±1. When there 
is a real entry there must be a —1 entry also. Let us assume there is 
a real entry. After permuting rows and columns we obtain a matrix of 
the form 

"l 1 1 1 " 

1-16-6 
1 a x z 
1 —a y w 

We may assume that a G C and b G span{l,i,j} after conjugating. 
Using orthogonality with the first row and column, and solving for x, y 

and z, we get 

x = — b — a + w, y = —l + a — w, and z = — 1 + 6 — w. 

Then using orthogonality between the second and third columns we 
get the equations (14. ip . Now since |x| = \z\ = 1 we have that 1 + a _L 
(1 — a) 6 or a pu _L 6 pu . Thus 6 G span{l, j} as required. 

Suppose we now have a dephased quaternion Hadamard matrix hav- 
ing no real entries apart from those in the first row and column. By 
lemma 14.51 the matrix is of the form: 

"11 11" 

yj- 1 a 6 c 

1 h-(l + a)/2 fc-(l + 6)/2 * 

1 -h-(l + a)/2 -jfe-(l + 6)/2 * 

where \a\ = |6| = |c| = 1; 2\h\ = |1 — a\, 2\k\ = \1 — b\ and h and k are 
orthogonal to 1 + a and 1 + 6 respectively. The orthogonality condition 
on the second and third columns gives 



(4.3) 



- Ahk = (1 + a)(l + b) + 2(1 + ab). 
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As h _L 1 + a and k _L 1 + b, we have 

h = s(l + a), k = t(l + b) 
where s and t are pure quaternions with 
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2|s| 



2\t\ 



1 + a 

Substituting into equation (14. 3p we get 

1 + a 



1-b 



(4.4) 



Ast = 1 + 2 



|l + a| 



(1 + ab) 



1 + b 

1 + 6 
1 + 6|< 



Note that c = -(1 + a + b) so |1 + a + 6| 2 = 1 which is 

2 + a + a + b + b + ab + ba = 
and is equivalent to 1 + a _L 1 + 6, and to 

(4.5) (1 + ai) (1 + 6 a ) + a 2 6 2 + a 3 &3 + ^4 = 0. 
We have 

(l + o)(l + a6)(l + 6) = 2 + a + a + b + b + ab + ab 

= ab — ba. 

Now \a\ 2 = \b\ 2 = 1 implies that |1 + a| 2 |l + b\ 2 = 4(1 + oi)(l + h). 
Since 

a pn b pu = {a\ — a)(b — b\) = — axb\ + a\b + b\a — ab, 
b pn a pn = {b - b 1 )(a 1 - a) = -a^i + a x b + b{a - ba, 

and 

qPU^PU + 6 pu a pu = _ 2 ( g p^ foP u ) = 2(1 + Oi)(l + &l), 

we have 

2b pu a pu = (2(l + ai)(l +h) - a pu b pu ) + b pu a pu 
= 2(1 + ai)(l + 6i) + 06-60. 
Hence ( 14.41) can be written as 

(4.6) st = uv 
where 



b pu 



pu 



U 



2(1 + 60' 2(1 + ai)' 

It is straightforward to check that \s\ = \v\ and \t\ = \u\. Equation 
(14.61) is equivalent to 

(s, t) — (u, v ), s x t = u x v. 

Thus span{s, t} = spanjw, v} and the angle between s and t equals the 
angle between u and v. Hence there is a rotation in the space of pure 
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quaternions that takes v to s and u to t, thus there is a unit norm 
quaternion x such that 

s = xvx, t = xux. 
Then ( 14. 6 j) gives that xt> mi = -ut> which after rearranging becomes 



(4.7) 
where 



vu 
\vu\ 



= y 

a pn b pn 



If we plug into H the expressions for s and t and switch the last two 
rows, we obtain the matrix: 



1 

a 

l+a = 1+q 
X |l+a| X |l+a| 

-k 



1 

b 

1+6 ^ 1+6 
X |l+6| x |1+6| 



Now since we are free to conjugate the matrix entrywise by a unit 
norm quaternion, we may assume that a = ai+a^i+aaj and b = fei+^i. 
Let a = ai + a 2 i. Since we assumed that a, b G" M., equation (14. 5p implies 
that a 2 7^ and so 

(l + aOa + ftx) 



«2 



Then from \b\ 2 = 1 we get 



((1 + ai) 2 + al)b\ + 2(1 + ai) 2 6i + (1 + a^ 2 - a 2 = 0. 



One of the roots of this quadratic is b\ 
Thus we must have that 



-1, which is not the case. 



;i + q 1 ) 2 -a 2 
(1 + ai) 2 + a 2 



and a simple computation gives 



b=- 



[1 + a] 



l + o 



2 ' 



If 03 = 0, then a is complex and the above formula shows that b = —a 
and c = — 1 which gives a contradiction. We conclude that a 3 ^ 0. 
Since a pu 6 pu = (a 2 i+a 3 j)6 2 i = — &2(o2 + «3k) and a 3 6 2 7^ 0, the equation 
xyx^ 1 

particular x = —x. As 



y implies that xkx 1 = —k. Thus x G span{i,j}, and in 



11 + d| 



:i + oi 



-2a 2 (l + a)i, 
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we have 



1 + 6 
T+b\ 



± 



1 + a . 
-i 

1 + a 



and we arrive at the result. 



□ 



5. QUATERNIONIC MUB 

Two orthonormal bases {ui}^ =1 , {vi}™ =1 in C n are said to be mutually 
unbiased bases (MUB) if 



for all 1 < i,j < n. A collection of orthonormal bases for C n is said 
to be a set of mutually unbiased bases if they are pairwise mutu- 
ally unbiased. We extend this definition in the natural way to HP. 
We consider H" as the right vector space over H consisting of col- 
umn vectors. We use the physicists' notation \x) to denote a column 
vector and (x\ to denote the conjugate transpose vector \x)*. The 
space HP is equipped with the standard positive definite inner product 
(x\y) = \x)*\y). We identify an orthonormal basis with any symplectic 
matrix that has as columns the vectors in the basis. The condition 
that a set S = {Ai, . . . , A k } of symplectic matrices be mutually unbi- 
ased is simply that y/nA*Aj is Hadamard for all 1 < i < j < k. If 
a set of MUB contains only Hadamard matrices (more precisely, ma- 
trices Ai such that \JnAi is Hadamard), we may include the identity 
matrix to create another set of MUB. Conversely if {Ai, . . . , Ak} is a 
set of MUB, then {I, A\A 2 , A\A k } is a set of MUB, where I is 
the n by n identity matrix and yfnA\Aj is Hadamard. Two sets of 
MUB {A , Ai, . . . , A k }, {B , Bi, . . . , B k } are to be considered equiva- 
lent if there exist symplectic monomial matrices Ei for < i < k and 
a symplectic matrix U such that UAiEi = B n ^ for all < i < k where 
ii is a permutation of {0, ... , k}. If a set of MUB contains the identity 
matrix, we may then additionally dephase some other matrix in the 
set. 

It is known that the upper bound on the size of a set of MUB in C™ 
is n + 1 |3] . For HP this is not the case, see Theorem 15.31 
We give an upper bound: 

Theorem 5.1. For n > 2, a set of quaternionic MUB in H n consists 
of at most 2n + 1 bases. 

Proof. Let H n be the real vector space of all Hermitian quaternionic n 
by n matrices, and H° its subspace consisting of zero trace matrices. 




1 



n 
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Note that dimHI° = 2n 2 — n — 1 = (n — l)(2n + 1). The inner product 
in H n is given by 

(A,B) = Tt(AB), 
where for any n by n quaternionic matrix X we define its trace by 

n 

TrX = 2^Re(X ii ). 

i=l 

Let £ = {|ei), |e„)} be an orthonormal basis of HP. To each unit 
vector |e) G HP we associate the operator 

\e}(e\--I n eU° n . 
n 

To the basis £ we associate the (n — l)-dimensional subspace Vs C H° 
spanned by the matrices 

Ei = |ej)(ej| - -J n , 1 < i < n. 
n 

Observe that the sum of these matrices is and they are vertices of 
a regular {n — l)-simplex in H°. Let T = {fx, /„} be another or- 
thonormal basis of HP and assume that £ and T are mutually unbiased. 
If 

F j = \f j )(f j \-l;In, !<3<n, 

then we have 
(E^Fj) = Tr (EiFj) 

= 2|(e l |/,)| 2 -^ 
= 0. 

Consequently Vs _L Vp. Since each of Ve , V^, ■ ■ ■ has dimension n — 1 
and dimHI° = (n — l)(2ri + 1), the assertion of the theorem follows. □ 

Remark 5.2. This proof is an obvious adaptation of the proof in the 
complex case sketched in the paper [3]. 

A set of 2n + 1 MUB in HP will be called complete. A set of MUB is 
maximal if it cannot be extended to a larger MUB. We shall see later 
that there exist maximal MUB which are not complete. Next we show 
that the above bound is attained in the case n = 2. 



-iTr(|e J )(e i |)--Tr(|/ J )(/ J |) + - 
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Theorem 5.3. For n = 2 there exists a complete set of MUB in HP. 

Moreover, it is unique up to equivalence. The matrices 



multiplied by 1/ \[2 and the identity matrix form such an MUB. 

Proof. Assume {I, A u A 4 } is a set of MUB. Thus V2Ai and V2A*Aj, 
i 7^ j, are 2 by 2 Hadamard matrices. After dephasing we may assume 



V2A l 



1 



If n > 2 we multiply A-i on the right by an appropriate symplectic 
diagonal matrix to get 



V2A 2 = 



for some unit norm quaternion a. Then since \/2A*A2 is Hadamard 
we have that |l + a| = |1 — a| = 1 so a is pure. We may conjugate by 
x E H to make a = i. Next we have 



V2A 3 



1 1 

b -b 



for some unit norm quaternion b. Again b must be pure but also we 
see that ib is pure. Thus b e span{j,k}. Conjugating by a complex 
number we have b = j. Finally, A4 has a similar form involving a unit 
norm quaternion c. Now we have c, ic and jc must be pure. This means 
c = ±k and if necessary we may switch the two columns of A4 to have 
c = k. A computation shows this is indeed a set of MUB. □ 

Now we consider the problem with n — 3. Let us introduce six 
families of quaternionic 3 by 3 Hadamard matrices which are unbiased 
with respect to the Fourier matrix F3. All these claims are easy to 
verify by using Maple. All of the families consist of matrices with the 
same basic form 



(5.1) 

where \a\ 



1 1 1 

a a( a( 2 
b bC b( 



\b\ = 

c = 



1 and 
1 



+ si + tj, s 2 + t 2 = 3/4; s,te 
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Let 

(5.2) p(a, s, t) = (a 2 , + a 2 )s + (aia 4 — a 2 a 3 )t 
and 

(5.3) ip(a, s, t) = 4a s 2 + 8aist + a 2 



a = 1 — ai + 4aia 2 + 2aia\ + 2020304 — 2a| — 2a 2 , 



where 

«o = - 

(5.4) ai = o 2 04 — a 2 a4 + 2010203 — 0104 + 0203, 
«2 = 1 — ai + 4aia 2 + 4aia| — 20104 — 6020304. 

First there is a 3-parameter family, to which we refer as the generic 
family, consisting of matrices (15. ip where <p(a,s,t) = 0, p(a,s,t) 7^ 0, 
and b is the unique solution to 

(5.5) (1 + ^,1 + ^(1-1, i,i6{0,l}. 

There are also five special families, each depending on two parame- 
ters. It should be understood that, in each case, only the listed restric- 
tions apply in addition to \a\ = \b\ = \(\ = 1. 



1 



a 



1, b\ = — ~ and 6 4 = 0. 



a = C, C 2 ; bi = 1 - 2(a 2 6 2 + a 3 b 3 ) and 6 4 = 2(a 3 6 2 - 02^3)- 
a = u, u) 2 ; bi = 1 — 2a2&2 and 64 = 2a2& 3 . 

°i = 1 5 a 2 = a 3 = ^> a 4 = 4a20 3 , bi — — I — 10464, 63 = ^-(1 — 
46i — 80262) and s = 0. 
5: 30 2 , = (1 — Ox) 2 , 6o| = (1 — Oi)(l + 2ox), a\ = 3o|, p(a,s,t) = 0, 



1 2 a 3 ' z 2a 2 a.2 2a 3 J 

The next theorem reveals the importance of these families for the 
problem of constructing a maximal set of MUB. We shall give two 
examples at the end of this section. 

Theorem 5.4. Let n = 3. Any quaternionic MUB {Aq, A\, A2} is 
equivalent to one where Aq = 1%, A\ = F% is the Fourier matrix, and 
A = y/3A 2 belongs to one of the six families defined above. 

Proof. We may assume that A = I 3 . Since any 3 by 3 quaternion 
Hadamard is equivalent to V0-F3, we may assume A\ = F$. We may 
also assume that the Hadamard matrix A has the form 



.4 



1 1 1 

ace 
b d f 
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Let x be a unit quaternion such that x 1 a 1 cx is complex, let X 



diag(x, x, x) and let 



Then we have that 



D 



a 





-i 








b- 1 



X~ X DAX 



1 1 1 

1 x~ 1 aT 1 cx x~ 1 cT 1 ex 
1 x~ 1 b~ 1 dx x~ 1 b~ 1 fx 



is Hadamard and by orthogonality we see that all entries are complex. 
By having permuted the columns of A if necessary, we see that the RHS 
of this equation must be V3F 3 . Then we have A = \^3D ~ 1 XF 3 X~ 1 . 
Thus if we let ( = xux~ l we get that A is in the form (15.11) . 

Now since A2 must be unbiased with respect to F3 we have that 
equations (I5.5P hold. If we take a and ( to be fixed, we obtain a linear 
system of equations Bb = v where B is a 4 by 4 real matrix and v is a 
real column vector. The entries of B and v depend on a and (. Note 
that since J3 and F3 are complex, we are free to conjugate A entrywise 
by a complex number of unit norm, and thus we assume that ( _L k. 

Let di be the determinant of the 4 by 4 matrix obtained by dropping 
the i th column of the augmented matrix B = [B\v]. It can be checked, 
e.g. with Maple, that 

(5.6) g?5 = 3p(a, s, t) 2 , 

and 



(5.7) 



2 -4 



= 9d 5 (p(a, s, t) 



where the congruence is modulo the ideal (\a\ 2 — 1, |C| 2 — 1). 
Now if det-B = ^ ^ then by Cramer's rule we have bi - 



±di/d 5 



and since |6| = 1 we get that ^2 i=1 d 2 



d 2 . 



From and ([LTD it 



follows that p ^ and if — 0. Thus A is in the generic family. 

If d 5 = then (I5.6P implies p(a, s, t) = 0. Note that by (15.71) we have 
then that di = for 1 < i < 4. We now distinguish three cases. 

Case 1: 03 = a 4 = 0. Rows 1 and 3 of B reduce to the system 

(1 + ai)bi + a 2 b 2 = — ai, 
a 2 h + (1 - ai)& 2 = 0-2- 



Since 



1 + di a 2 
a 2 1 — di 



0. 
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the two equations must be linearly dependent and so ai — a\ + a 2 = 
a 2 (l + 2ai) = 0. Thus either a 2 = and so a = 1, or ai = — | and 
a = uj or c<j 2 . If a = 1 then we see that b± — — \ and row 4 gives either 
t = or 64 = 0. If t — then £ is complex and by conjugating A 
entrywise by a complex number we can assume 6 4 = 0. Thus we get 
that A is in Family 1. If a x = — \ then we see that by rows 1 and 2 we 
have b\ + 2a 2 b 2 = 1 and t(3b 3 — 2a 2 5 4 ) = 0. Thus we get that t = 
or 64 = 2a 2 b 3 . If 64 = 2a 2 6 3 then A is in Family 3. If t — then £ is 
complex and by conjugating A entrywise with a complex number we 
see that A is in Family 2. 

Case 2: a\ + a\> 0; aia 4 — a 2 a 3 = 0. Then s = 0. By calculating 
a Groebner basis for the ideal (s, a,\a± — a 2 a 3 , |C| 2 — 1, |a| 2 — 1) one 
may check that all 3 by 3 minors of B are 0. Thus all 3 by 3 minors 
of B have to be as well. After computing a Groebner basis for the 
ideal generated by the generators of the previous ideal, along with the 
remaining minor equations, one sees that a 4 G {0, ±|}. If a 4 = 
then a 3 7^ so a 2 = 0. Clearly a\ 7^ 1 since a\ + a\ > 0, thus we 
see that a\ = —\. Solving the system we get b\ = 1 — 2a 3 6 3 and 
64 = 2a 3 6 2 thus A is in Family 2. If 04 = ±| then ai = | and a 2 = ^. 
Furthermore Cl 2 — ^Cl 3 Cl4. Then solving the system we find from row 1 
that b x = -\- ^a 4 b 4 and then b 3 = ^f-(l - 4&i - 8a 2 b 2 ). Thus A is in 
Family 4. 

Case 3: a\ + a\ > 0; 0404 — a 2 a 3 7^ 0. Then s ^ and t ^ 0. By 
calculating a Groebner basis for the ideal (p(a, s,t), \(\ 2 — 1, \a\ 2 — 1) 
one may check that all 3 by 3 minors of B are 0. Thus all 3 by 3 
minors of B have to be as well. Collect all 3 by 3 minors of B and 
append the polynomials \(\ 2 — 1, \a\ 2 — 1 and p(a,s,t), and compute 
the Groebner basis of the ideal J they generate. By using this basis it 
is easy to verify that the set of polynomials 

{a 4 t(3ag — a 2 ), a 3 (t 2 — a 2 — a 2 ), t(3a 2 — 04) (1 + 2ai), 

4a 3 (st — a 2 a 3 ) + a4(a 3 + a 2 — 4a 2 a 3 ct4), ct4t(a 2 a4 — a 3 (l — ai)), 
t(a 2 (l + 2ai) - 2a 3 a 4 )}, 

is contained in J. From a 4 t(3a| — a 2 ) = we deduce that a 3 7^ and 
that either 04 = or a\ = 3a§. If 04 = we obtain that a\ = — \, 
st = a 2 a 3 , and t 2 = a\. Thus s 2 = a 2 , and a = ( or £ 2 . In both cases 
we get b\ = 1 — 2(a 2 b 2 + a 3 6 3 ) and 64 = 2(a 3 6 2 — a 2 b 3 ). Thus A is 
in Family 2. If a 4 7^ then a 2 = 3a|, t 2 = 4a|, a 2 a4 = a 3 (l — ai) 
and 2a 3 a 4 = a 2 (l + 2ai). From the last two equations we find that 
2a 2 = (1 - ai)(l + 2ai) and 

3a 2 (l + 2ai) = 6a 2 (l - ai) = 2a 2 (l - a t ) = (1 - ai) 2 (l + 2ai), 
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which gives 3a 2 , = (1 — a-i) 2 . Now rows 1 and 2 of system B give 



l 

2 



a 3 



1 — ai 

2ao 



0363 

a 2 2a 3 



Thus A is in Family 5. 



□ 



As promised, we now give two families of genuine quaternionic MUB 
consisting of four bases (i.e., symplectic matrices). The first one de- 
pends on one parameter only, while the second depends on three pa- 
rameters. 

Example 5.5. We give here a one parameter family of quaternionic 
MUB of size 4 which includes the complete complex MUB as a par- 
ticular case. They are parameterized by the points (s, t) on the circle 
s 2 + t 2 = 3/4 and are given by 



where 



- + si 



c 1 1 

A(s,t)= 1 C 1 , C 

} 1 c 

For t = 0, s = V3/2 this is a complex MUB. 

We have verified using Maple and Theorem 15. 4[ that such MUB, for 
arbitrary (s, t) is not extendible to an MUB of size 5. 

Example 5.6. We now give a normalized 3-parameter family of max- 
imal quaternionic MUB: 

h, F 3 , ^ A ( a , b , c ), ^ S ( a ' & ' c )' 

The a, b, c are any quaternionic cubic roots of unity orthogonal to k. 
Thus each of them has the form 



-~ + si + ty, 



s 2 + t 2 



3/4. 



The Hadamard matrices A and B belong to our family 1 and are given 
by 



A(a, b, c) 



1 1 
1 a 
b ba 2 



a 
ba 



B(a, 6, c) 



1 1 1 

1 c c 2 
b be 2 be 



The family is normalized in the sense that we have fixed the first two 
matrices to be J3 and F 3 . 

It is not hard to verify that the four orthonormal bases provided by 
the columns of these matrices are indeed MUB. For example if we take 
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the conjugate transpose of the second column in A and multiply by the 
second column of B we get the sum 1 + a 2 c + abc 2 . By multiplying on 
the left and right by a 2 and c respectively, this sum has the same norm 
as a 2 c + ac 2 + b = 2(a 2 c 2 + a 3 c 3 ) + b 2 i + b 3 j + 2(a 3 c 2 - a 2 c 3 )k. The 
norm squared of this sum is easily seen to be 3 as required. If we take 
the special case a = b = c = (, we obtain a 1-parameter family which 
is equivalent to the one from the previous example. 

6. Open Problems 

Here we mention some relevant problems that remain open. 

Problem 6.1. For arbitrary dimension n, real, complex or quater- 
nionic, characterize the singular points of <p r : 0(n) — > A n and the 
critical points of (fi c : U(n) — > A n and <ph '■ Sp(n) — > A n . 

The singular points of <p r for n < 4 only depend on the positions of 
zero entries in the orthogonal matrix. It would be interesting to see if 
this phenomenon persists in higher dimensions. 

Problem 6.2. Is it true that the twelve polynomial equations £4 sat- 
isfied by C?4 characterize this set? 

Problem 6.3. Determine all maximal sets of MUB in H 3 . 

We have found a 3-parameter family of such MUB consisting of four 
bases. This family contains the maximal set of complex MUB that is 
unique up to equivalence in C 3 . 

Problem 6.4. Describe the set of all quaternionic Hadamard matrices 
of size 5 or 6. 

We have found two families of quaternionic Hadamard matrices of 
size 4 and we have shown that any quaternionic Hadamard matrix of 
size 4 is equivalent to a matrix in one of these families. 

Problem 6.5. Prove that the maps <p c : U(n) — > A n and <ph : Sp(n) — > 
A n have regular points for all n > 1. 

The assertion is easy to verify for small values of n and it is known 
to be true if n is prime [12J. This problem is mentioned in [5]. 



ORTHO-, UNI- AND QU-STOCHASTIC MATRICES 



2!) 



References 

[1] Y-H. Au-Yeung and C-M. Cheng, Permutation matrices whose convex combi- 
nations are orthostochastic, Linear Alg. Appl. 150 (1991), 243-253. 

[2] Y-H. Au-Yeung and Y-T. Poon, 3 by 3 orthostochastic matrices and the con- 
vexity of generalized numerical ranges, Linear Alg. Appl. 27 (1979), 69-79. 

[3] S. Bandyopadhyay, P. O. Boykin, V. Roychowdhury and F. Vatan, A new proof 
for the existence of mutually unbiased bases, Algorithmica 34 (2002), 512-528. 

[4] I. Bengtsson, W. Bruzda, A. Ericsson, J. Larsson, W. Tadej and K. Zyczkowski, 
Mutually unbiased bases and Hadamard matrices of order six, J. Math. Phys. 
48, 052106 (2007). 

[5] I. Bengtsson, A. Ericsson, M. Kus, W. Tadej and K. Zyczkowski, Birkhoff's 
polytope and unistochastic matrices, N=3 and N=4, Communications in Math- 
ematical Physics 259 No.2, (2005), 307-324. 

[6] P. Di^a, Separation of unistochastic matrices from the double stochastic ones: 
Recovery of a 3 x 3 unitary matrix from experimental data, J. Math. Phys. 47, 
083510 (2006). 

[7] H.G. Gadiyar, K.M.S. Maini, R. Padma and H.S. Sharatchandra, Entropy and 

Hadamard matrices, J. Phys. A 36 (2003), L109-L112. 
[8] TY. Lam, Introduction to quadratic forms over fields, American Mathematical 

Society, 2004. 

[9] Maplesoft, Waterloo Maple Inc., Maple 9.01, 2003. 
[10] H. Nakazato, Set of 3 x 3 orthostochastic matrices, Nihonkai Math. J. 7 No.2, 
(1996), 83-100. 

[11] W. Tadej and K. Zyczkowski, A concise guide to complex Hadamard matrices, 

Open Sys. & Information Dyn. 13 (2006), 133-177. 
[12] W. Tadej, K. Zyczkowski and W. Slomczyhski, Defect of a unitary matrix, 

arXiv: \math/07025ld\ vl 17 Feb 2007. 

Department of Pure Mathematics, University of Waterloo, Water- 
loo, Ontario, N2L 3G1, Canada 

E-mail address: ochteren@uwaterloo.ca 
E-mail address: djokovic@uwaterloo.ca 



